Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 7503 |
| Missing cells | 83 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 586.3 KiB |
| Average record size in memory | 80.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 1 |
longitude is highly overall correlated with latitude and 1 other fields | High correlation |
latitude is highly overall correlated with longitude and 1 other fields | High correlation |
total_rooms is highly overall correlated with total_bedrooms and 2 other fields | High correlation |
total_bedrooms is highly overall correlated with total_rooms and 2 other fields | High correlation |
population is highly overall correlated with total_rooms and 2 other fields | High correlation |
households is highly overall correlated with total_rooms and 2 other fields | High correlation |
median_income is highly overall correlated with median_house_value | High correlation |
median_house_value is highly overall correlated with median_income | High correlation |
ocean_proximity is highly overall correlated with longitude and 1 other fields | High correlation |
total_bedrooms has 82 (1.1%) missing values | Missing |
Reproduction
| Analysis started | 2023-06-12 10:37:07.006142 |
|---|---|
| Analysis finished | 2023-06-12 10:37:45.137970 |
| Duration | 38.13 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
longitude
Real number (ℝ)
| Distinct | 534 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -119.50729 |
| Minimum | -124.35 |
|---|---|
| Maximum | -114.55 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 7503 |
| Negative (%) | 100.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | -124.35 |
|---|---|
| 5-th percentile | -122.3 |
| Q1 | -121.79 |
| median | -118.44 |
| Q3 | -118.22 |
| 95-th percentile | -117.9 |
| Maximum | -114.55 |
| Range | 9.8 |
| Interquartile range (IQR) | 3.57 |
Descriptive statistics
| Standard deviation | 1.8357769 |
|---|---|
| Coefficient of variation (CV) | -0.015361213 |
| Kurtosis | -0.66416901 |
| Mean | -119.50729 |
| Median Absolute Deviation (MAD) | 0.45 |
| Skewness | -0.68719835 |
| Sum | -896663.22 |
| Variance | 3.370077 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| -118.3 | 133 | 1.8% |
| -118.28 | 131 | 1.7% |
| -118.29 | 124 | 1.7% |
| -118.31 | 123 | 1.6% |
| -118.27 | 120 | 1.6% |
| -118.25 | 111 | 1.5% |
| -118.26 | 105 | 1.4% |
| -118.43 | 97 | 1.3% |
| -118.44 | 90 | 1.2% |
| -118.24 | 86 | 1.1% |
| Other values (524) | 6383 |
| Value | Count | Frequency (%) |
| -124.35 | 1 | < 0.1% |
| -124.3 | 2 | < 0.1% |
| -124.27 | 1 | < 0.1% |
| -124.26 | 1 | < 0.1% |
| -124.25 | 1 | < 0.1% |
| -124.23 | 3 | |
| -124.22 | 1 | < 0.1% |
| -124.21 | 3 | |
| -124.19 | 4 | |
| -124.18 | 6 |
| Value | Count | Frequency (%) |
| -114.55 | 1 | < 0.1% |
| -114.63 | 1 | < 0.1% |
| -114.65 | 1 | < 0.1% |
| -114.66 | 1 | < 0.1% |
| -114.73 | 1 | < 0.1% |
| -114.98 | 1 | < 0.1% |
| -115.32 | 1 | < 0.1% |
| -115.37 | 4 | |
| -115.38 | 2 | |
| -115.39 | 1 | < 0.1% |
latitude
Real number (ℝ)
| Distinct | 580 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.608879 |
| Minimum | 32.67 |
|---|---|
| Maximum | 41.95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 32.67 |
|---|---|
| 5-th percentile | 33.931 |
| Q1 | 34.05 |
| median | 34.2 |
| Q3 | 37.71 |
| 95-th percentile | 39.09 |
| Maximum | 41.95 |
| Range | 9.28 |
| Interquartile range (IQR) | 3.66 |
Descriptive statistics
| Standard deviation | 1.9905447 |
|---|---|
| Coefficient of variation (CV) | 0.055900234 |
| Kurtosis | -0.65479909 |
| Mean | 35.608879 |
| Median Absolute Deviation (MAD) | 0.25 |
| Skewness | 0.78043287 |
| Sum | 267173.42 |
| Variance | 3.9622681 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34.05 | 200 | 2.7% |
| 34.06 | 185 | 2.5% |
| 34.04 | 172 | 2.3% |
| 34.07 | 171 | 2.3% |
| 34.08 | 165 | 2.2% |
| 34.1 | 152 | 2.0% |
| 34.09 | 149 | 2.0% |
| 34.02 | 146 | 1.9% |
| 34.03 | 142 | 1.9% |
| 33.99 | 138 | 1.8% |
| Other values (570) | 5883 |
| Value | Count | Frequency (%) |
| 32.67 | 5 | |
| 32.68 | 3 | < 0.1% |
| 32.69 | 3 | < 0.1% |
| 32.7 | 1 | < 0.1% |
| 32.73 | 2 | < 0.1% |
| 32.74 | 2 | < 0.1% |
| 32.75 | 3 | < 0.1% |
| 32.76 | 3 | < 0.1% |
| 32.77 | 1 | < 0.1% |
| 32.78 | 10 |
| Value | Count | Frequency (%) |
| 41.95 | 1 | |
| 41.92 | 1 | |
| 41.88 | 1 | |
| 41.84 | 1 | |
| 41.81 | 1 | |
| 41.8 | 2 | |
| 41.78 | 1 | |
| 41.77 | 1 | |
| 41.76 | 1 | |
| 41.75 | 2 |
housing_median_age
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.325337 |
| Minimum | 1 |
|---|---|
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 24 |
| median | 34 |
| Q3 | 41 |
| 95-th percentile | 52 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 12.082609 |
|---|---|
| Coefficient of variation (CV) | 0.3737814 |
| Kurtosis | -0.6488065 |
| Mean | 32.325337 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -0.2694662 |
| Sum | 242537 |
| Variance | 145.98945 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 52 | 582 | 7.8% |
| 36 | 380 | 5.1% |
| 35 | 376 | 5.0% |
| 34 | 301 | 4.0% |
| 33 | 255 | 3.4% |
| 37 | 234 | 3.1% |
| 32 | 227 | 3.0% |
| 42 | 204 | 2.7% |
| 39 | 195 | 2.6% |
| 43 | 188 | 2.5% |
| Other values (42) | 4561 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 2 | 9 | 0.1% |
| 3 | 12 | 0.2% |
| 4 | 37 | |
| 5 | 44 | |
| 6 | 34 | |
| 7 | 37 | |
| 8 | 54 | |
| 9 | 49 | |
| 10 | 70 |
| Value | Count | Frequency (%) |
| 52 | 582 | |
| 51 | 26 | 0.3% |
| 50 | 85 | 1.1% |
| 49 | 76 | 1.0% |
| 48 | 94 | 1.3% |
| 47 | 118 | 1.6% |
| 46 | 140 | 1.9% |
| 45 | 152 | 2.0% |
| 44 | 184 | 2.5% |
| 43 | 188 | 2.5% |
total_rooms
Real number (ℝ)
| Distinct | 3656 |
|---|---|
| Distinct (%) | 48.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2366.7754 |
| Minimum | 2 |
|---|---|
| Maximum | 32054 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 588.1 |
| Q1 | 1341 |
| median | 1932 |
| Q3 | 2823.5 |
| 95-th percentile | 5548 |
| Maximum | 32054 |
| Range | 32052 |
| Interquartile range (IQR) | 1482.5 |
Descriptive statistics
| Standard deviation | 1877.7956 |
|---|---|
| Coefficient of variation (CV) | 0.79339831 |
| Kurtosis | 32.635209 |
| Mean | 2366.7754 |
| Median Absolute Deviation (MAD) | 692 |
| Skewness | 4.075463 |
| Sum | 17757916 |
| Variance | 3526116.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1745 | 12 | 0.2% |
| 1463 | 10 | 0.1% |
| 1613 | 10 | 0.1% |
| 1513 | 10 | 0.1% |
| 1788 | 9 | 0.1% |
| 1287 | 9 | 0.1% |
| 1582 | 9 | 0.1% |
| 1527 | 8 | 0.1% |
| 1438 | 8 | 0.1% |
| 2225 | 8 | 0.1% |
| Other values (3646) | 7410 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 15 | 1 | |
| 18 | 2 | |
| 21 | 1 | |
| 22 | 2 | |
| 24 | 1 | |
| 32 | 1 | |
| 36 | 2 |
| Value | Count | Frequency (%) |
| 32054 | 1 | |
| 28258 | 1 | |
| 27700 | 1 | |
| 21533 | 1 | |
| 20354 | 1 | |
| 19059 | 1 | |
| 18690 | 1 | |
| 18634 | 1 | |
| 18448 | 1 | |
| 17820 | 1 |
total_bedrooms
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 1399 |
|---|---|
| Distinct (%) | 18.9% |
| Missing | 82 |
| Missing (%) | 1.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 505.90742 |
| Minimum | 2 |
|---|---|
| Maximum | 5290 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 137 |
| Q1 | 286 |
| median | 411 |
| Q3 | 608 |
| 95-th percentile | 1186 |
| Maximum | 5290 |
| Range | 5288 |
| Interquartile range (IQR) | 322 |
Descriptive statistics
| Standard deviation | 379.51557 |
|---|---|
| Coefficient of variation (CV) | 0.75016802 |
| Kurtosis | 16.994557 |
| Mean | 505.90742 |
| Median Absolute Deviation (MAD) | 146 |
| Skewness | 3.0754402 |
| Sum | 3754339 |
| Variance | 144032.07 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 460 | 26 | 0.3% |
| 280 | 25 | 0.3% |
| 399 | 25 | 0.3% |
| 290 | 23 | 0.3% |
| 309 | 23 | 0.3% |
| 246 | 23 | 0.3% |
| 295 | 23 | 0.3% |
| 313 | 22 | 0.3% |
| 289 | 22 | 0.3% |
| 318 | 21 | 0.3% |
| Other values (1389) | 7188 | |
| (Missing) | 82 | 1.1% |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 4 | |
| 7 | 5 | |
| 8 | 3 | |
| 9 | 3 | |
| 10 | 2 | < 0.1% |
| 11 | 4 |
| Value | Count | Frequency (%) |
| 5290 | 1 | |
| 4457 | 1 | |
| 4183 | 1 | |
| 4179 | 1 | |
| 3984 | 1 | |
| 3864 | 1 | |
| 3493 | 1 | |
| 3298 | 1 | |
| 3179 | 1 | |
| 3079 | 1 |
population
Real number (ℝ)
| Distinct | 2697 |
|---|---|
| Distinct (%) | 35.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1383.2884 |
| Minimum | 3 |
|---|---|
| Maximum | 15507 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 366 |
| Q1 | 789 |
| median | 1143 |
| Q3 | 1692.5 |
| 95-th percentile | 3199 |
| Maximum | 15507 |
| Range | 15504 |
| Interquartile range (IQR) | 903.5 |
Descriptive statistics
| Standard deviation | 1005.5996 |
|---|---|
| Coefficient of variation (CV) | 0.72696304 |
| Kurtosis | 21.282855 |
| Mean | 1383.2884 |
| Median Absolute Deviation (MAD) | 415 |
| Skewness | 3.1748619 |
| Sum | 10378813 |
| Variance | 1011230.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1005 | 16 | 0.2% |
| 788 | 14 | 0.2% |
| 861 | 11 | 0.1% |
| 911 | 11 | 0.1% |
| 825 | 11 | 0.1% |
| 835 | 11 | 0.1% |
| 753 | 11 | 0.1% |
| 850 | 11 | 0.1% |
| 986 | 11 | 0.1% |
| 761 | 11 | 0.1% |
| Other values (2687) | 7385 |
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 6 | 1 | |
| 8 | 1 | |
| 13 | 1 | |
| 14 | 1 | |
| 15 | 1 | |
| 17 | 1 | |
| 18 | 1 | |
| 20 | 2 | |
| 21 | 1 |
| Value | Count | Frequency (%) |
| 15507 | 1 | |
| 15037 | 1 | |
| 12203 | 1 | |
| 10988 | 1 | |
| 9671 | 1 | |
| 9427 | 1 | |
| 9135 | 1 | |
| 8997 | 1 | |
| 8907 | 1 | |
| 8768 | 1 |
households
Real number (ℝ)
| Distinct | 1324 |
|---|---|
| Distinct (%) | 17.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 474.59536 |
| Minimum | 2 |
|---|---|
| Maximum | 5050 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 126 |
| Q1 | 272 |
| median | 388 |
| Q3 | 568.5 |
| 95-th percentile | 1107.9 |
| Maximum | 5050 |
| Range | 5048 |
| Interquartile range (IQR) | 296.5 |
Descriptive statistics
| Standard deviation | 352.98647 |
|---|---|
| Coefficient of variation (CV) | 0.74376301 |
| Kurtosis | 17.293959 |
| Mean | 474.59536 |
| Median Absolute Deviation (MAD) | 137 |
| Skewness | 3.0741277 |
| Sum | 3560889 |
| Variance | 124599.45 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 306 | 25 | 0.3% |
| 334 | 25 | 0.3% |
| 311 | 25 | 0.3% |
| 380 | 24 | 0.3% |
| 340 | 24 | 0.3% |
| 292 | 24 | 0.3% |
| 329 | 24 | 0.3% |
| 295 | 23 | 0.3% |
| 277 | 23 | 0.3% |
| 269 | 23 | 0.3% |
| Other values (1314) | 7263 |
| Value | Count | Frequency (%) |
| 2 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 3 | |
| 7 | 7 | |
| 8 | 2 | < 0.1% |
| 9 | 3 | |
| 10 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
| 13 | 4 |
| Value | Count | Frequency (%) |
| 5050 | 1 | |
| 4204 | 1 | |
| 4072 | 1 | |
| 3701 | 1 | |
| 3595 | 1 | |
| 3302 | 1 | |
| 3293 | 1 | |
| 3262 | 1 | |
| 3061 | 1 | |
| 2902 | 1 |
median_income
Real number (ℝ)
| Distinct | 5620 |
|---|---|
| Distinct (%) | 74.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5970367 |
| Minimum | 0.4999 |
|---|---|
| Maximum | 15.0001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 0.4999 |
|---|---|
| 5-th percentile | 1.4625 |
| Q1 | 2.2948 |
| median | 3.1818 |
| Q3 | 4.35115 |
| 95-th percentile | 7.13832 |
| Maximum | 15.0001 |
| Range | 14.5002 |
| Interquartile range (IQR) | 2.05635 |
Descriptive statistics
| Standard deviation | 1.9180808 |
|---|---|
| Coefficient of variation (CV) | 0.53323915 |
| Kurtosis | 6.1417389 |
| Mean | 3.5970367 |
| Median Absolute Deviation (MAD) | 0.9899 |
| Skewness | 1.9153888 |
| Sum | 26988.566 |
| Variance | 3.6790339 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.125 | 24 | 0.3% |
| 2.875 | 22 | 0.3% |
| 3.875 | 21 | 0.3% |
| 15.0001 | 21 | 0.3% |
| 2.625 | 19 | 0.3% |
| 4.125 | 17 | 0.2% |
| 3.375 | 16 | 0.2% |
| 3.25 | 15 | 0.2% |
| 1.625 | 15 | 0.2% |
| 4.375 | 14 | 0.2% |
| Other values (5610) | 7319 |
| Value | Count | Frequency (%) |
| 0.4999 | 8 | |
| 0.536 | 4 | |
| 0.5495 | 1 | < 0.1% |
| 0.6775 | 1 | < 0.1% |
| 0.6831 | 1 | < 0.1% |
| 0.6991 | 1 | < 0.1% |
| 0.716 | 1 | < 0.1% |
| 0.7286 | 1 | < 0.1% |
| 0.7403 | 1 | < 0.1% |
| 0.7473 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 15.0001 | 21 | |
| 15 | 1 | < 0.1% |
| 14.2867 | 1 | < 0.1% |
| 13.947 | 1 | < 0.1% |
| 13.6842 | 1 | < 0.1% |
| 13.5728 | 1 | < 0.1% |
| 13.499 | 1 | < 0.1% |
| 13.4883 | 1 | < 0.1% |
| 13.4196 | 1 | < 0.1% |
| 13.2949 | 1 | < 0.1% |
median_house_value
Real number (ℝ)
| Distinct | 2814 |
|---|---|
| Distinct (%) | 37.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 194794.83 |
| Minimum | 12 |
|---|---|
| Maximum | 500001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 60210 |
| Q1 | 110350 |
| median | 169300 |
| Q3 | 243200 |
| 95-th percentile | 456210 |
| Maximum | 500001 |
| Range | 499989 |
| Interquartile range (IQR) | 132850 |
Descriptive statistics
| Standard deviation | 112512.7 |
|---|---|
| Coefficient of variation (CV) | 0.57759592 |
| Kurtosis | 0.71889417 |
| Mean | 194794.83 |
| Median Absolute Deviation (MAD) | 64600 |
| Skewness | 1.1034988 |
| Sum | 1.4615456 × 109 |
| Variance | 1.2659107 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500001 | 298 | 4.0% |
| 162500 | 44 | 0.6% |
| 112500 | 42 | 0.6% |
| 137500 | 41 | 0.5% |
| 187500 | 37 | 0.5% |
| 225000 | 34 | 0.5% |
| 350000 | 32 | 0.4% |
| 175000 | 30 | 0.4% |
| 87500 | 28 | 0.4% |
| 150000 | 27 | 0.4% |
| Other values (2804) | 6890 |
| Value | Count | Frequency (%) |
| 12 | 1 | |
| 14999 | 2 | |
| 17500 | 1 | |
| 22500 | 1 | |
| 25000 | 1 | |
| 26600 | 1 | |
| 26900 | 1 | |
| 30000 | 2 | |
| 32500 | 1 | |
| 32900 | 1 |
| Value | Count | Frequency (%) |
| 500001 | 298 | |
| 500000 | 5 | 0.1% |
| 499000 | 1 | < 0.1% |
| 498700 | 1 | < 0.1% |
| 498600 | 1 | < 0.1% |
| 498400 | 1 | < 0.1% |
| 497600 | 1 | < 0.1% |
| 497400 | 1 | < 0.1% |
| 495600 | 2 | < 0.1% |
| 495500 | 1 | < 0.1% |
ocean_proximity
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 58.7 KiB |
| <1H OCEAN | |
|---|---|
| INLAND | |
| NEAR BAY | |
| NEAR OCEAN | 173 |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.9765396 |
| Min length | 6 |
Characters and Unicode
| Total characters | 59840 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NEAR OCEAN |
|---|---|
| 2nd row | NEAR OCEAN |
| 3rd row | NEAR OCEAN |
| 4th row | NEAR OCEAN |
| 5th row | NEAR OCEAN |
Common Values
| Value | Count | Frequency (%) |
| <1H OCEAN | 3854 | |
| INLAND | 2188 | |
| NEAR BAY | 1287 | 17.2% |
| NEAR OCEAN | 173 | 2.3% |
| (Missing) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ocean | 4027 | |
| 1h | 3854 | |
| inland | 2188 | |
| near | 1460 | 11.4% |
| bay | 1287 | 10.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 9863 | |
| A | 8962 | |
| E | 5487 | |
| 5314 | ||
| O | 4027 | |
| C | 4027 | |
| < | 3854 | 6.4% |
| 1 | 3854 | 6.4% |
| H | 3854 | 6.4% |
| I | 2188 | 3.7% |
| Other values (5) | 8410 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 46818 | |
| Space Separator | 5314 | 8.9% |
| Math Symbol | 3854 | 6.4% |
| Decimal Number | 3854 | 6.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 9863 | |
| A | 8962 | |
| E | 5487 | |
| O | 4027 | |
| C | 4027 | |
| H | 3854 | 8.2% |
| I | 2188 | 4.7% |
| L | 2188 | 4.7% |
| D | 2188 | 4.7% |
| R | 1460 | 3.1% |
| Other values (2) | 2574 | 5.5% |
Space Separator
| Value | Count | Frequency (%) |
| 5314 |
Math Symbol
| Value | Count | Frequency (%) |
| < | 3854 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3854 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 46818 | |
| Common | 13022 | 21.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 9863 | |
| A | 8962 | |
| E | 5487 | |
| O | 4027 | |
| C | 4027 | |
| H | 3854 | 8.2% |
| I | 2188 | 4.7% |
| L | 2188 | 4.7% |
| D | 2188 | 4.7% |
| R | 1460 | 3.1% |
| Other values (2) | 2574 | 5.5% |
Common
| Value | Count | Frequency (%) |
| 5314 | ||
| < | 3854 | |
| 1 | 3854 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 59840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 9863 | |
| A | 8962 | |
| E | 5487 | |
| 5314 | ||
| O | 4027 | |
| C | 4027 | |
| < | 3854 | 6.4% |
| 1 | 3854 | 6.4% |
| H | 3854 | 6.4% |
| I | 2188 | 3.7% |
| Other values (5) | 8410 |
| longitude | latitude | housing_median_age | total_rooms | total_bedrooms | population | households | median_income | median_house_value | ocean_proximity | |
|---|---|---|---|---|---|---|---|---|---|---|
| longitude | 1.000 | -0.839 | 0.090 | -0.058 | 0.020 | 0.181 | 0.042 | -0.005 | 0.139 | 0.724 |
| latitude | -0.839 | 1.000 | -0.209 | 0.121 | 0.001 | -0.160 | -0.030 | 0.068 | -0.188 | 0.772 |
| housing_median_age | 0.090 | -0.209 | 1.000 | -0.332 | -0.314 | -0.282 | -0.286 | -0.088 | 0.098 | 0.232 |
| total_rooms | -0.058 | 0.121 | -0.332 | 1.000 | 0.902 | 0.767 | 0.898 | 0.316 | 0.279 | 0.017 |
| total_bedrooms | 0.020 | 0.001 | -0.314 | 0.902 | 1.000 | 0.870 | 0.980 | 0.021 | 0.169 | 0.042 |
| population | 0.181 | -0.160 | -0.282 | 0.767 | 0.870 | 1.000 | 0.897 | -0.023 | 0.073 | 0.088 |
| households | 0.042 | -0.030 | -0.286 | 0.898 | 0.980 | 0.897 | 1.000 | 0.057 | 0.203 | 0.047 |
| median_income | -0.005 | 0.068 | -0.088 | 0.316 | 0.021 | -0.023 | 0.057 | 1.000 | 0.649 | 0.094 |
| median_house_value | 0.139 | -0.188 | 0.098 | 0.279 | 0.169 | 0.073 | 0.203 | 0.649 | 1.000 | 0.331 |
| ocean_proximity | 0.724 | 0.772 | 0.232 | 0.017 | 0.042 | 0.088 | 0.047 | 0.094 | 0.331 | 1.000 |
| longitude | latitude | housing_median_age | total_rooms | total_bedrooms | population | households | median_income | median_house_value | ocean_proximity | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | -124.35 | 40.54 | 52 | 1820 | 300.0 | 806 | 270 | 3.0147 | 94600 | NEAR OCEAN |
| 1 | -124.30 | 41.80 | 19 | 2672 | 552.0 | 1298 | 478 | 1.9797 | 85800 | NEAR OCEAN |
| 2 | -124.30 | 41.84 | 17 | 2677 | 531.0 | 1244 | 456 | 3.0313 | 103600 | NEAR OCEAN |
| 3 | -124.27 | 40.69 | 36 | 2349 | 528.0 | 1194 | 465 | 2.5179 | 79000 | NEAR OCEAN |
| 4 | -124.26 | 40.58 | 52 | 2217 | 394.0 | 907 | 369 | 2.3571 | 111400 | NEAR OCEAN |
| 5 | -124.25 | 40.28 | 32 | 1430 | 419.0 | 434 | 187 | 1.9417 | 76100 | NEAR OCEAN |
| 6 | -124.23 | 41.75 | 11 | 3159 | 616.0 | 1343 | 479 | 2.4805 | 73200 | NEAR OCEAN |
| 7 | -124.23 | 40.81 | 52 | 1112 | 209.0 | 544 | 172 | 3.3462 | 50800 | NEAR OCEAN |
| 8 | -124.23 | 40.54 | 52 | 2694 | 453.0 | 1152 | 435 | 3.0806 | 106700 | NEAR OCEAN |
| 9 | -124.22 | 41.73 | 28 | 3003 | 699.0 | 1530 | 653 | 1.7038 | 78300 | NEAR OCEAN |
| longitude | latitude | housing_median_age | total_rooms | total_bedrooms | population | households | median_income | median_house_value | ocean_proximity | |
|---|---|---|---|---|---|---|---|---|---|---|
| 7493 | -115.37 | 32.82 | 14 | 1276 | 270.0 | 867 | 261 | 1.9375 | 80900 | INLAND |
| 7494 | -115.37 | 32.82 | 30 | 1602 | 322.0 | 1130 | 335 | 3.5735 | 71100 | INLAND |
| 7495 | -115.37 | 32.81 | 32 | 741 | 191.0 | 623 | 169 | 1.7604 | 68600 | INLAND |
| 7496 | -115.32 | 32.82 | 34 | 591 | 139.0 | 327 | 89 | 3.6528 | 100000 | INLAND |
| 7497 | -114.98 | 33.07 | 18 | 1183 | 363.0 | 374 | 127 | 3.1607 | 57500 | INLAND |
| 7498 | -114.73 | 33.43 | 24 | 796 | 243.0 | 227 | 139 | 0.8964 | 59200 | INLAND |
| 7499 | -114.66 | 32.74 | 17 | 1388 | 386.0 | 775 | 320 | 1.2049 | 44000 | INLAND |
| 7500 | -114.65 | 32.79 | 21 | 44 | 33.0 | 64 | 27 | 0.8571 | 25000 | INLAND |
| 7501 | -114.63 | 32.76 | 15 | 1448 | 378.0 | 949 | 300 | 0.8585 | 45000 | INLAND |
| 7502 | -114.55 | 32.80 | 19 | 2570 | 820.0 | 1431 | 608 | 1.2750 | 56100 | INLAND |